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1 .  INTRODUCTION 


Various  rest  procedures  in  multivariate  analysis  are  based  upon 

certain  functions  of  the  eigenvalues  of  random  matrices.  A  considerable 

amount  of  work  was  done  in  the  literature  on  the  asymptotic  distribution 

theory  of  these  statistics  when  the  sample  size  is  very  large.  But, 

many  situations  arise  in  multivariate  data  analysis  when  the  number  of 

variables  and  the  sample  size  are  both  very  large.  So,  there  is  a 

great  need  to  investigate  the  distributions  of  various  functions  of  the 

eigenvalues  of  large  dimensional  random  matrices.  Distributions  of  the 

eigenvalues  of  large  dimensional  random  matrices  arise  (e.g.,  see 

Mehta  (1967))  in  nuclear  physics  also. 

Some  work  was  done  in  the  literature  on  the  limiting  empirical 

distribution  function  (e.d.f.)  of  large  dimensional  random  matrices. 

Here,  we  note  that  the  e.d.f.  of  a  random  matrix  Z:  p *  p  is  defined 

as  N(x)/p  where  N(x)  denotes  the  number  of  the  eigenvalues  of  Z  which 

are  less  than  or  equal  to  x.  The  e.d.f.  (also  known  as  spectral  distribution)  of 

Z  is  useful  in  deriving  the  distributions  of  certain  functions  of  the  eigenvalues  of  Z 

Now,  let  Sj  :  p  *  p  be  distributed  as  central  Ulshart  matrix  with  m  degrees 

of  freedom  and  E(Sj/m)  “  1^.  Also,  let  p  and  m  both  tend  to  infinity  such  that 

llm(p/m)  -  y  >  0.  Then,  It  is  known  (see  Crenander  and  Silversteln  (1977), 

Jonason  (1982)  and  Wachter  (1978))  that  the  e.d.f.  F  (x)  of  the  eigen- 

m  1 

values  of  S^/m  tends  to  F^(x)  where  F^(x)  is  a  distribution  function  with 
density  function  given  by 


2  2 

where  a  •  (1  -  /y)  ,  and  b  ■  (1  +  /y)*  and  0  <  y  <  lj  for  1  <  y  <  *»,  Fy(x)  has 
mass  1  -  (1/y)  at  zero  and  f^(x)  on  (a,b).  Yin  and  Krishnaiah  (1983b)  showed 
that  the  spectral  distribution  of  the  sample  covariance  matrix  has  a  limit  when 
the  underlying  distribution  is  isotropic  and  y  <  1. 


2 


Yin  and  Krishnaiah  (1983a)  shoved  that  the  spectral  distribution  of  S^T/  m 
tends  to  a  limit  in  probability  for  each  x  under  the  following  conditions: 

(a)  T  is  a  symmetric,  positive  definite  matrix  and  G  (x)  is  the  e.d.f. 

of  the  eigenvalues  of  T,  ** 

(b)  Sj  and  T  are  independent  of  each  other, 

(c)  lim(p/m)  *  y  exists  and  finite 

p 

(d)  j*x^dGp(x)  — +  exists  in  for  k  «  1,  2,...  and  "  “* 

Yin  and  Krishnaiah  (1984b)  extended  the  above  result  to  the  case  when  is  the 
sample  sums  of  squares  and  cross  products  matrix  based  upon  observations  from  an 
isotropic  population. 

Now,  let  Sj!  p*p  and  S2:  p*p  be  distributed  independently  as  central  Wishart  matrices 

with  m  and  n  degrees  of  freedom  and  E(S^/m)  ■  E(S2/n)  ■  1^.  Then,  the  distribu¬ 
tion  of  nSjS^/m  is  known  to  be  the  central  multivariate  F  matrix.  Applying  the 
result  of  Yin  and  Krishnaiah  (1983),  Yin,  Bai,  and  Krishnaiah  (1983)  showed  that  the 

limit  of  the  spectral  distribution  of  the  central  multivariate  F  matrix  exists 
when  p/m-M-y*  exists  and  (p/n)  — ►y  <  \  as  p+“.  Silverstein  (1984b)  showed  the 
validity  of  the  above  result  even  for  the  case  <  y  <  1  by  making  a  minor 
modification  in  the  proof  of  Yin,  Bai,  and  Krishnaiah  (1983).  Yin  and  Krishnaiah 
(1983a) gave  an  expression  for  the  moments  of  the  limit  of  the  e.d.f.  of  the 
eigenvalues  of  S^T/m.  Starting  from  this  expression,  Silverstein  (1984^  derived 
an  explicit  expression  for  the  limit  of  thee.d.f .  of  the  eigenvalues  of  the  multivariate 
F  matrix.  In  this  paper,  the  authors  give  an  alternative  derivation  of  the 
above  limit.  The  authors  also  gave  explicit  expressions  for  the  moments  of  the 
above  limiting  distribution  and  these  expressions  are  not  known  in  the  literature. 
Extensions  of  the  results  of  this  paper  for  nonnormal  populations  and  nonnull 
cases  are  under  investigation. 


2.  PRELIMINARIES 


In  this  section,  we  give  some  results  which  are  needed  in  the  sequel  as 
well  as  a  brief  review  of  known  results  on  limiting  spectral  distribution 

of  a  multivariate  F  matrix. 

Lemma  2 . 1  Let  z  e  (.0,1),  a'  ■  (1  -  /z)2  and  b'  ■  (1  +  /z)2.  If  0  1 1|  _<  a1  , 

then 

2k  |b,  ir hrr {(x  •  a’)(b'  • x)},s  dx  *k[i  +  z  - 1  -  fa  -  z  - 1>2  - 4t> 
3  (2.1) 

Fro<Tf  Making  the  transformation  u  *  (2x  ~(b'  +  a')]/(b'  -  a')  in  the  left 
side  of  (2.1),  we  obtain 


R(t) 


(x  ~-~'t)  * fx  “  (b'  -  x)  )^dx 


(b'  -  a*) 
4irz 


(2.2) 


where  A  *=  (b'  t  a'  -  2t)/(b*  -  a’).  It  is  known  (see  Jonsson  (1982)) 
that  R(0)  ■  l.  So,  for  any  re  (0,1),  we  have 

1  f1  / 1  2 v>* 

_ ]_  (1  -  u  ) _  „  j 

ir/r  J-l  u  +  ((l  +  r)/2/r  ) 


Now,  let  A  »  (1  +  r)/2/r.  Since  A  >  1,  the  condition  r  e  (0,1)  is  satisfied. 
So,  using  (2.2)  and  (2.3),  we  obtain 


R(t)  -  (b'  -  a' )  (A  -  A2  -  l)/4z  »  U  ♦  z  -  t  -  {(1  -  z  -  t)2  -  4tz}$5]/2a 


(2.4) 


Lemma  2 . 2  For  any  nonnegative  integers  a  and  w,  we  have 

lw/2]  , 

l  (-D 

t-0 


1 m  l 

a 

1 

ro 

!• 

m 

-  2W 

1*1 

2m  -  w 

w 

(2.5) 
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m\ 

Here  I  is  defined  to  be  zero  when  m  <  w. 

w| 

Proof  If  m  *  0,  the  proof  is  trivial.  We  now  prove  the  result  for  m  >  0  by 
induction.  Suppose  the  result  is  true  for  a  fixed  value  of  m.  Then,  we  have 


3.  MOMENTS  OF  THE  LIMITING  SPECTRAL  DISTRIBUTION 
OF  A  MULTIVARIATE  F  MATRIX 

Let  F  (x)  denote  the  limit  of  the  e.d.f.  of  the  multivariate  F  matrix 
y,z 

*  00 

as  defined  in  the  preceding  section.  Also,  let  {E^}  denote  the  moments  of 
F  (x).  Then,  from  Yin  and  Krishnaiah  (1983),  we  know  that 

y»z 


where 


k-i  k\ 

Ek“  omrB(k*w) 

w»0  w 


- 1  v^r^-S  "i1  -  \0Z' 


(3.1) 


(3.2) 


and  the  summation  in  (3.2)  is  over  all  possible  values  of  n^ . "k-w  su^Ject 

to  the  restrictions  n^+  +n^  w  ■  w  +  1  and  n^  +  n2  +  •••  +(k  -  w)n^  ^  *  k. 

Also,  *  E(x  *)  for  i  ■  1,  2,...  where  the  density  of  x  is  given  by 


gz(x) 


(x  -  a* )  (br  -  x) 


a’  <  x  <  b' 


otherwise 


(3.3) 


where  a' ,  b' ,  and  z  are  as  defined  in  the  preceding  section.  For  any  |t|  <  a' 


<Kt)  -  tHx  +  t2H2  +  t3H3  +  .  .  . 
-  E(t/(x  -  t)) 


JL  fb’ 
2irz  . 


[t((x  -  a’)(b'  -  x)P/(x  -  t)]dx 


-  (1  -  z  -  t)[l  -  (1  -  {(4tz/(l  -  z  -  t)^)P)]/2z  (3.4) 

If 

by  using  Lenma  2.1.  Since  B(k,w)  is  the  coefficient  of  t  in  Taylor  expansion 
of  (<Kt))  i  we  obtain 


B(k,w) 


w+1  /w+1  .  lc  /  2.  \  ...  , 

l  <-l)1  l  2 

£-0  \  £  j-0  \i 

lw  +  l-2j\ 

.  (l  -  z)w+1~j_k/_nk“j 


3.5 


6 


If  w  +  1  -  2j  >  0, 


w  +  1  -  2j 


k  -  J 

.1 


0  implies  O^j^w  +  l-  k^O,  i.e.  j  *  0. 


w+1  w  +  1 

Noticing  £  (-1)*  *  0,  we  see  that  the  expression  of  B(k,w)  only  contains 

i=0\  « 


W+1 

the  terms  with  j  >  — - — .  If  £  is  even,  then 


+  0  Implies  j  ^  4  —  ~ 


I 


W  +  1 

which  is  contrary  to  j  >  -  .  Thus,  in  the  expression  of  B(k,w),  there  are 


2 

W  +  1 

only  the  terms  with  j  >  -  and  l  being  odd.  Applying  lenina  2.2,  we  obtain 


U(k,w) 


[w/2] 

l 

1*0 


w+1 
2H  +  1 
w  +  1  -2  j 
l  k  -  j 


(-l)k+1  I 

(i  - 


J-[^l 


l  + 

i 


j2  j-l-w^J  -w-1 


[w/2]  w+1 

L 

£-0  2J.+  1 


k 

1 


J-[^1 


(2&  +  1)  !  (2j  -  2  -  2&)  ! 
j  IA!(J  -£-!)! 


x  (-oW^d  -  z )»*-}-*■ 


fk  +  j  -  w  -  2 


k  -  j 


V  jk  +  j  -  w  -  2)!(w  +  I)  1-w-l  ,  .v+l-j 

j-w+1  J  *  0  -  w  -  1)  !  (it  -  ])  !  z  O  *  z) 


-1-k 


k-w- 1 

l 

i 


(k  +  j  -  1 )  !  (w  +  1) 


i;0  (i  +  W  +  1 ) !  j !  (k  -  j  -  w  -  D!  -  z)  k  J 


(3.6) 


From  (3.b)und  (3.6)  it  follows  that 


k- 1 


-  l  vw 


w-0 


k-w- 1 

i 


w-  l 

y _ (k  +  1  -  1)! 

“o  J  * (j  +  w  +  1) ! (k  -  j  - 


-  z^(l  -  z)_k_j 

w  -  1)!  z  u  z) 


k-l  , 

l  2J(1 
1-0 


\  .  k-l  -  j 

-  y 


l  y 

w-0 


(k  +  1  -  1) 


j ! (j  +  w  +  1) ! (k  -  j  -  w  -  1)! 

(3.7) 

The  k-th  moment  ol  V  (•)  can  be  easily  computed  from  (3.7) 


4.  DERIVATION  OF  THE  LIMITING  SPECTRAL  DISTRIBUTION 
OF  MULTIVARIATE  F  MATRIX 


into 


Using 


in  (4 


Substituting  the  well-known  formulas 


M  r 

k  -  j  -  w  -  11 

M  +  w 

+ 1 

^w|  t-0 

t 

[  w  - 

t 

(4.1) 


(3.7),  we  obtain  the  following  by  changing  the  order  of  summation: 
k-1 


r*=J=i] 


E,  =  l  zj(l  -  z)_k_j  L  2 


j-0 


2  ,.  .  .  n.  t  ,,  .k-j-l-2t 

r  (k  +  j  -  1) !  y  (1  +  y)  J _ 

(k  -  i  -  1  -  2t) ! t! ( j  +  t  +  l)!j! 


t-0 


k-1 

1 

J-0 

k-1  1 2 1 


I  zj(l  -  z)-k_J  f  (k  +  1  -  1)  !yt_^  (1  +  y)^-1"26 

t-j  (k  +  J  -  1  -  2t)!(t  -  j)!j!(t  +  1)! 


I 

t-0  \t 


t  +  1 


I  zj(l  -  z)'k"j 
j-0 


t  \ 


k  +  j  -  1 


the  formulas 


2t 


l 

s-0 


.  2) ,  we  obtain 


k-1  |2t\ 

l 


t-0  \t 


1  2t  k  -  1\ 

l  l 

s-t\  s  /  j-0 


x  yt_J (1  +  y)k+J~1_2t 


k-1 

l 

t-0 


2t' 


\  t 


2t 


-J—  y 

t  +  1  L 


s-t 


k  -  1 


bl 

\  2t 

r-i 

j 

\  s  , 

2t  - 

t 

1  ^ zj 
j-0 

(1  -  z) 

s-t 

t 

U  -  j/ 

s-t 

1/  t 

l  S-t  / 

t-j,.  ..  vk+j-l-2t 
y  J(1  +  y)  J 

(4.2) 


(4.3) 


-k-j 


s-t 


1 

j-0 


s-t 

l  i 


Zt_j(l 


-  z) 


-k-t+j 


x  y-^  (1  +  y)k 


|2t! 

l  2t 

1  y 

lk  - 

t 

l«] 

t  +  i  i 
s-t 

i  s  J 

s-t 

2t-s,,  ,-k-t 

z  (1  -  z) 


„  ,,  ,  .k-s-1,  ,  .s-t 

*  (l  +  y)  (y  +  z) 


7' 


r 


1  y 

k-1 

tt 

t  +  1  s=0 

[s  +  t. 

s 

t-s,,  . -k-t 

z  (1  -  z) 


,  .  k-t-s- 1  ,  ,  .  s 

x  (1  +  y)  (y  +  z) 


(4  .A) 


l  l 


1  k-1 

/C 

I  It' 

\  s  +  t 

\s 

1‘ 

1  t-s,,  N-k-t 

z  (1  -  z) 


t  +  1 


X  (1  +  y)^-  s  -l“t(y  +  zy 


i  ■  Kk'V 

/ 2]  k- 1 - 2 s  . 
? 

(k-1 

j  k  -  1  -  2sj 

1  (2s)! (2s  +  2t) ! 

'  s=0 

L  | 

t-0  1 

l  23  , 

(  1  1 

j  s!(2s  +  t) ! (s  +  t  +  1)! 

„  t-.  , -k-t-s  .k-2s-l-t,  ,  Ns 

x  z  (l  -  z)  (1  +  y)  (y  +  z) 


(A. 5) 


2  2  -U 

Define  a  random  vector  (U.V)  where  the  marginal  density  of  U  is  —  (1  -  x  )  ‘I/ri  n(x) 

rr  (.0,1) 

and  the  conditional  density  of  V  given  U  =  x  is 

i  -  x2  [X7,  fT- 

It  is  easy  to  see  that 


x2  ] 


(v), 


(A.  6) 


E{U2sV2t+2s} 


(2s) ! (2t  +  2s) ! 

! ! (2s  +  t)!(s  +  t  +  1)! 


,  -2s-t 


(A. 7) 


and  that 


E{U2s+1v2t+2s+1}  *  0. 


Hence,  from  (A. 5),  (A. 7)  it  follows  that 


k-1  k-l-s 

V  V 

k  -  ll 

k  -  1  -  sj 

L  L 

3=0  t*0 

l  9 

‘  I 

t  -k-c-  i 

zc(l  -  z)  2 


- ,  ,  N k-s-t- 1  ,  .  >.2  ,,sT.2t+s 

X  (1  +  y)  (y  +  z)  U  V 


1 


,  ^  ,  AV2z  l  AUV/y  +  z 
1  +  y  +  -  +  - 1 - 


(1  -  z)" 

Now,  we  compute  the  distribution  of 


1  -  z 


/I  - 


k-1 


(A. 8) 


— *  A  *  1  -  *■  '  —  “ 


(4.9) 


L1  +  y+r^  +  ^=±f"J 


/  (1  -  2)  . 


Let  w1  =  UV  and  w2  ■  V  .  Then  we  can  easily  show  that  the  joint  distribution 
of  (w ^ ,  w2)  is 


1  .  2. -3/2  _ 

-  w,(w,  -  w.)  I 

t«J*>2«w2] 


where  Ir  takes  value  1  if  ct  <  6  is  true  and  zero  otherwise.  Applying 
[  a  <8  ]  —  _ 

4#y  +  z  4z 

Lemma  2.3,  we  can  compute  the  density  of  — *■ -  w,  +  ^ - -  w0  as  follows: 


1  1  -  z  "2 


Let  q^  <  q2  be  the  two  roots  of  the  equation 


(1  -  z)“  2  .  1  -  z  ,  ,2  1  -  z 

~16z2  q  +  1 6 (y'~+  z)  (x  -  q)  =-4T-q 


(4.10) 


in  the  variable  q.  Let  a  *  l,  8=0,  r  *  {(1  -  z)/16(y  +  z)}x^,  6  *  ^  z 

and  p  =  refr-Hr  •  Then 


4z  8(y  +  z) 
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1  -  z  2 
x  , 


4(y  +  z) 


.2  .  +  2a5  .  +  „ 


A,  -  S2  +  4yp  .  <1  -  «>  [jr  +  *  +  *ll 

16zZ(y  +  z) 


By  Lemma  2.3,  we  get  the  density  of 


4/y  +  z  ,  4z 
— *■ -  w,  + 


/I  -  z 


,  +  -  w0  as  given  below:* 

I  1  —  z  z 


fj(x) 


1  ✓!  -  z  (1  -  z) 


2  fq2 


4/y  +  z  16z‘ 
5/2 


1  -  z  _  1  -  z  ,2  -  3/2, 

~ T  q  ’  T6(7T~zT  (*  *  q)  dq 


** 


_  (1  -  z) 


i1  +  A2q 


64irz  /y  +  z  A., /( 1  -  z)  1  -  z  ^  7i 

V~rr^ q  ~  i4(y  t  ,)<»- 1> 


•  (4.11) 


Since  q^  and  q2  are  roots  of  equation  (4.10),  we  have 


*  Use  the  formula  fx+(x)  * 
of  x+y  and  (x,y)  respectively. 


f (x-q,q)  dq  where  f  (.)  and  f(.,.)  are  the  densities 
x+y 


**  Note  that  the  integrand  is  zero  outside  the  interval  [q^ , q2 ]  by  the  indicator  factor 


J. 


64ttz  /y  +  z  A 
,  5/ 2 


(1  -  z)' 


3  4z  M  |l*l 

(“V  q2  -  qi 


64ttz  /y  +  z  A 


i  -  z  q,q 


3  4z 


1M2 


(4.12) 


From  (4.10) 


we  can  compute 

2  2 

„  _  z  x 

qlq2  y  +  z  -  yz 

2z/(y  +  z) (-x2(l  -  z)  +  4xz  +  4(y  +  z)) 
q2  *  ql - - T+~—~z - - 


(4.13) 

(4.14) 


From  (4.12)  -  (4.14), 

fx(x)  -  °  — 2~~~= 
64irz  2/y  +  z 


It  follows  that 

[x2(l  -  z)/^,  +  z)  2z +  z)(-x2(l  -  z)  +  4xz  +  4 (y  +  z)) 

(1  -  z)2[y  +  z  +  xz]  1  -  z  2  2 

?  4z  z  x 

16z^(y  +  z) 


(y^-zVxzT  ^~x2(1  ~ 2)  4  4x2  +  4(y  +  z)  • 


2tt  (y  +  z  +  xz) 

From  this  we  can  easily  obtain  the  density  of 


(4.15) 


1  +  y  +J^z_  t  4UVi/y  +  z 


1  -  z 


(1  -  z 


as 


f2(x) 


(1  -  z)/-(l  -  z)2x2  +  2(1  +  y  +  z  -  yz)x  -  (1  -  y^2 


2u(xz  +  y) 


(1  -  z)V(x  -  aWb  -  x) 
2tr(xz  +  y) 


(4.16) 


where  a  -  »  -  'S-  t  i  I  ul  ani  b  -  Sl±jSZIZji£.  . 

(I  -  zV  (1  -  z) 

Since  f^(x)  4  0  iff  equation  (4.10)  has  two  different  roots,  we  find  by 
checking  the  steps  of  computation  that  f2(x)  4  0  iff  a  <  x  <  b.  Recalling  (4.9), 
we  obtain 


1  fb  k-1 

1  -  z  X 

*  a 


fj  (x)dx 


iiz5l  q  _  _i Zl _ 

4z  "  16fv+zl 


(x-q)2  -  (^rp-  q)2  for  q 


q,  or  q.. 


(4.17) 


*  Note  that 


Now,  let 


f(x) 


f  2<x) 

x(l  -  z) 


C  (1  -  zV  (x  -  a)  (b  -  x) 
/  2irx(y  +  xz) 

(0 


if  a  <  x  <  b 
otherwise. 


(4.18) 


In  [7],  it  is  shown  that  the  distribution  F  is  determined  by  all  its  moments 

y  •  z 

From  (4.11)f(4.12)  it  follows  that 


F 

y-z 


(x) 


Yl{(0,»)}(x) 


f (x)dx. 


(4.19) 


where  Y  =  1  - 


J-  0 


f(x)dx. 


Finally,  we  only  need  to  compute  the  integral 
U  =  [2X  -  (b  +  a)]/(b  -  a),  we  get 


f(x)dx.  If  we  set 


I  = 


f(x)dx  = 


(1  -  z) 
2  try 


f  Ax  -  a)  (b  -  x)  , 

- dx  - 

x 

a 


/(x  -  a) (b  -  xT 


x  + 


_  (1  -  z)(b  -  a) 
4iry 


A  -  °2  du  -  fb  A  - 1 


-i  u  +  xi 


a  u  +  X2 


du 


dx 


where  X^  =  ^  +  ^  and  X2  *  —  +  /z ^  Using  (2.2)  we  get 


(1  -  z)  (b  -  a) 


1  -  z 
2y 


1  -  z 


4y 

J ab  + 


[(X,  -  J\\  -  1)  -  (X2  -  /x2  -.1)J 


(a  +  b)  +  — y  -  /ab  -  %■ 
z  2  z 

z 


F  y  ±  z  _  ji  -  y|  _  z 


2y  ^  z  (1  -  z)  1  -  z 

^  [i  +  y  -  |i  -  y|] 


_l 

y 


if  0  <  y  <  1 
if  y  >  1. 


Hence 


1  -  - 

y 


Substituting  this  into  (4.19),  we  get  the  expression  of  Fv.-(*)- 


ADDED  IN  PROOF 


After  this  paper  was  typed,  the  following  interesting  paper  was  brought 
to  the  attention  of  the  authors: 

Wachter,  K.  W.  (1980).  The  limiting  empirical 
measure  of  multiple  discriminant  ratios.  Annals 
of  Statistics,  8,  937-957. 

In  the  above  paper,  Wachter  gave  an  explicit  expression  for  the  e.d.f.  of 
(XJX')(XX')  1  in  the  limiting  case  when  p  -*■  « ,  J  is  a  projection  matrix 
and  the  columns  of  X:  p*  (n+m)  are  distributed  independently  as  multivariate 
normal  with  mean  vector  0  and  covariance  matrix  I.  When 


m 

0 


we 


observe  that  (XJX') (XX’)"1  -  (XjXj)  (XjXj  +  X^) ”1  where  X-  X2> 


The  random  matrix  considered  by  us  is  equivalent  to  (X^XjKX^X^)  1  and  so  our 
result  on  the  limiting  e.d.f.  can  be  obtained  from  the  result  of  Wachter. 

But,  the  method  used  by  us  is  entirely  different  from  the  method  used  by 
Wachter  (1980).  Also,  we  derived  the  moments  of  the  above  distribution 


and  these  expressions  are  new. 
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